Automatic Annotation in Multirelational Information Networks
نویسنده
چکیده
Many networks are completely encapsulated using a single node type and a single edge type. Often a more complicated model composed of multiple distinct node and edge types can be constructed to create a more informative network [2]. We call the former homogeneous networks and the latter heterogeneous. The ability to homogenize networks varies wildly, dependent on the network under analysis and the problem being solved. We present a class of networks multirelational information networks where the heterogeneous structure is necessary for performing node classi cation and detecting missing information in the network. A multirelational information network is a network G with nodes V that map to real world objects and concepts which we call entities, and with edges E which represent relations between these entities. We use nodes and edges interchangeably with entities and relations. For the multirelational networks we will consider, there exist a true vertex labeling function lv : V → 2\{}, where Σ is an alphabet of node types and an observed vertex labeling function l̂v : V → 2. Informally, entity types will exist in a hierarchy and we require that the labeling only map a vertex to a set of types T such that for any pair (ti,tj), i 6= j, ti is a descendant of tj in the hierarchy or vice versa. Similarly, there exists labeling functions l̂e and le for the edges, but we will not be exploring the existence of an edge hierarchy in this work. Our goal is to learn the true vertex labeling function from the observed multirelational information network and labels. Speci cally, given network G = (V,E), hierarchy T , and observed labeling functionl̂v, we will learn the true labeling function lv. Additionally, we predict missing data elds in nodes that have incomplete relations. Namely, the edgeset E that we observe is not the true and complete set of relations that exist in the world. For example, the Mustang entity and the Camaro entity have many features in common, but only the Camaro has the model years relation. We would like to automatically recommend relations that may exist for a node based on our estimation of the true set of relations E∗. More precisely, given G = (V,E), we would like to provide A : V− > E s.t. A takes a node v and outputs a subset of edges that it should be incident upon v. The heterogeneous structure of the network is vital to this task, as we are essentially determining the missing pieces of the heterogeneous structure. Being able to discover entity types and missing relations will provide a method for database maintainers to discover and correct missing information in their entities. We can directly present our algorithm to database providers to automatically discover incompleteness in their data. We will provide a speci c example of usefulness in 2.1.
منابع مشابه
Tags Re-ranking Using Multi-level Features in Automatic Image Annotation
Automatic image annotation is a process in which computer systems automatically assign the textual tags related with visual content to a query image. In most cases, inappropriate tags generated by the users as well as the images without any tags among the challenges available in this field have a negative effect on the query's result. In this paper, a new method is presented for automatic image...
متن کاملFuzzy Neighbor Voting for Automatic Image Annotation
With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...
متن کاملA CAD System Framework for the Automatic Diagnosis and Annotation of Histological and Bone Marrow Images
Due to ever increasing of medical images data in the world’s medical centers and recent developments in hardware and technology of medical imaging, necessity of medical data software analysis is needed. Equipping medical science with intelligent tools in diagnosis and treatment of illnesses has resulted in reduction of physicians’ errors and physical and financial damages. In this article we pr...
متن کاملAutomatic Colorization of Grayscale Images Using Generative Adversarial Networks
Automatic colorization of gray scale images poses a unique challenge in Information Retrieval. The goal of this field is to colorize images which have lost some color channels (such as the RGB channels or the AB channels in the LAB color space) while only having the brightness channel available, which is usually the case in a vast array of old photos and portraits. Having the ability to coloriz...
متن کاملImprovement of generative adversarial networks for automatic text-to-image generation
This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011